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METHOD FOR MO LECULE-MOLECULE INTERACTIONAL 
Technical field 

The invention relates to a method for labelling and analysing molecule-molecule 
interactions, preferably by mass spectroscopy. Furthermore, the invention relates to 
a kit for use in the method. 

Background 

The success of the genome projects has resulted in the identification of a vast num- 
ber of open reading frames (ORFs), which potentially code for proteins. The main 
problem is now to assign functions to the forty to sixty percent of the ORFs in a ge- 
nome for which no function can be allocated. Two analytical chemical approaches 
to this can be identified: transcriptomics and proteomics. Transcriptomics analyses 
the expression levels of the various mRNA species being transcribed using either a 
'gene-chip' or a SAGE approach. This is useful for identifying under which condi- 
tions a particular protein is being expressed but provides little direct information as 
to function. Proteomics, the direct quantitation and analysis of expressed proteins, 
provides a more direct approach to function definition. Proteomics can be divided 
into two areas; Expression Proteomics attempts to define all the proteins being ex- 
pressed in a cell and their post-translational modifications and how these change 
under various conditions. Cell-Map Proteomics attempts to define the subcellular 
location of a protein and with which other proteins it is interacting. It is this field 
which the inventors wish to address in this grant application. 

Traditionally, protein-protein interactions have been analysed by the isolation of 
protein complexes by 'soft 1 non-denaturing physico-chemical methods such as cen- 
trifugation or affinity based isolations. This approach has been facilitated by the use 
of mass spectrometry to analyse the purified protein complexes either as mixtures or 
after separation by SDS-PAGE. The method suffers from several drawbacks, the 
main one being the stability of the complex under the conditions of purification and 
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a lack of a general purification approach to allow a systematic analysis of many 
proteins. The latter problem has been successfully addressed by the development of 
the TAP procedure by the group of Bernard Seraphin at EMBL (Rigaut et al., Nat 
Biotechnol 1999 Oct; 17(10): 1030-2 and Puig et al., Methods 2001 July; 24(3): 
218-29). ORFs are modified so that the proteins they encode contain two affinity 
purification (TAP) tags, which allow the labelled protein to be rapidly purified to 
homogeneity. The resulting complex is then analysed by mass spectrometry to iden- 
tify the co-purifying proteins. 

An alternative approach to protein-protein interaction analysis has been the devel- 
opment of the yeast and bacterial two- and three-hybrid systems (Fromont-Racine et 
al., Nat Genet 1997 Jul; 16(3): 277-82 and Uetz et al., Nature 2000 Feb 10; 
403(6770): 623-7). This has allowed genome-wide scans of all protein-protein in- 
teractions in a genome to be carried out. The main drawback here is that transient 
interactions and those induced by ligands or phosphorylation are not amenable to 
analysis by this method. The TAP purification and two-hybrid system methods do 
not allow one to define which protein is interacting with which other protein in the 
complex and which parts of the two proteins are involved in this interaction. 

EP 0 778 280 (Isis Innovation Limited) relates to a reagent for use in biological and 
chemical analyses. More specifically, the reagent is comprised of at least two ana- 
lyte groups linked to a tag comprising one or more reporter groups adapted for de- 
tection by mass spectrometry (MS). More specifically, the group for MS detection is 
a tertiary amino group, which increases sensitivity and which does not allow gen- 
eration of a specific ion for parent ion scanning. Hence, the disclosed reagent cannot 
be used in parent ion scattering. 

Further, WO 00/02893 (Brax Group Limited) relates to a method of characterising 
an analyte, which method comprises to provide a compound in which the analyte is 
attached by a cleavable linker to a mass marker relatable to the analyte; to cleave the 
mass marker from the analyte; and to identify the mass marker. More specifically, 
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the marker is a metal ion-binding moiety. To achieve a high ionisation, the labels 
disclosed are pre-ionised. 

Accordingly, there is a need for alternative methods in the field of proteomics, 
solving the posed problems and providing new opportunities to gain more detailed 
information about molecule-molecule interactions. 

Summary of the invention 

One object of the invention is to provide a method meeting the demands on this 
point. This and other objects are accomplished by a method for labelling a target 
biomolecule, interacting with a specific biomolecule of interest, also denoted bait, 
as disclosed in the first claim of the application. Hereby, the target biomolecule is 
labelled with a unique mass marker, which preferably is detected by mass spectros- 
copy, using a parent ion scanning mode. The method has a wide applicability, al- 
lowing the study of interactions between several types of biomolecules. A specific 
object of the invention is to provide means for the study of interactions between 
proteins and small molecules or ligands. A further object of the invention is to pro- 
vide a kit for use in the method of the invention. 

An advantage with the present invention is that it enables focus on transient and low 
affinity protein-protein (ligand) interactions that the conventional methods described 
in the introduction do not allow. The inventor discloses herein a mass spectrometric 
approach that will enable one to pick out the 'target protein 1 to which the labelled 
bait protein has been crosslinked. The protein of interest, the bait is modified with a 
chemical or photoactivatable linker either in situ or externally and then introduced 
into the cells. The bait can then be cross-linked by photolysis to the target under de- 
fined conditions. The cell is then lysed and the crosslinker cleaved to leave a unique 
chemical label (the mass marker) on the target, which allows it to be rapidly identi- 
fied by 'parent ion' scanning in a mass spectrometer. The method 'mass marker 
transfer' is applicable not only to protein-protein interaction mapping but also to 
detennining the targets of small organic (or inorganic) molecules (ligands) in the 
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cell. This approach has the advantage in that it not only enables one to identify the 
interacting partner but also to determine the domain on the protein responsible for 
the interaction. The method is ideally suited to the analysis of rapid transient inter- 
actions such as those that occur in signalling cascades as well as for the identifica- 
tion of receptors for small molecules such as drugs or signalling metabolites. 

Other objects and further advantages of the present invention will appear from the 
detailed disclosure that follows. 

Brief description of the drawings 

Figure 1 shows the general form of linker molecule of the invention. 
Figure 2 shows the chemical structure of a specific linker of the invention. 
Figure 3 is a schematic outline of the principle of the mass marker transfer method 
of the invention. 

Figure 4 shows the parent ion scanning principle. 

Figure 5 shows a daughter ion spectrum and HPLC UV vs. RIC trace. 

Figure 6 is a photo of a diagonal gel electrophoresis run. 

Definitions 

By "a target biomolecule" is in this context meant the molecule which is desired to 
find and analyse. 

By the term "interacting biomolecule" is meant a biomolecule that can attach to a 
target by chemical binding, ionic interaction, hydrogen bonding, affinity adsorption 
or any other principle that couple one biomolecule to another. The interaction may 
well be based on more than one of the above mentioned principles. 
By "a biomolecule of interest", or "bait" is meant a molecule potentially having 
specificity for the target biomolecule. It is the interaction between the biomolecule 
of interest and the target biomolecule that is desired to monitor by the invention. 
By "linker molecule" is meant a molecule, which is used for cross-linking the bio- 
molecule of interest and the target biomolecule. The linker molecule comprises "an 
attachment part" for binding to the biomolecule of interest, "a photoactivatable 



WO 03/056342 



5 



PCTYEP02/14315 



part", which has the ability to be activated and thereby be able to bind to the target 
biomolecule, "a cleavable part", which may be cleaved during the analysis step of 
the invention, and "a mass marker part" which provides a unique mass marker for 
the subsequent analysis. 

Detailed description of the invention 

A first aspect of the invention is a method for identifying an interacting target bio- 
molecule to a biomolecule of interest comprising the steps of: 

(a) providing a biomolecule of interest having specificity for the target; 

(b) binding the biomolecule of interest to at least one type of linker molecule, the 
linker molecule comprising at least one attachment part for binding to the 
biomolecule of interest, one cleavable part, one mass marker part and one 
photoactivatable part, for binding to the target; 

(c) contacting the biomolecule of interest with a cell or a cell extract; 

(d) exposing the cell to photolysis, whereby the photoactivatable part binds to 
the target; 

(e) cleaving the linker molecule, thereby leaving the photoactivatable part and 
the mass marker part bound to the target; 

(f) analysing the product of step (e), thereby detecting the mass marker part, 
thus identifying the interacting target biomolecule to the biomolecule of in- 
terest. 

The target biomolecule is a polypeptide, a protein, a nucleotide, a small molecule, 
or any other biomolecule, such as a fatty acid or a carbohydrate. Preferably, the tar- 
get biomolecule is a protein. 

As a first step of the invention the biomolecule of interest is provided. Preferably, 
the biomolecule of interest is provided in a concentration as close as possible to that 
as is naturally found in the cell type being analysed. The method for isolating the 
biomolecule of interest depends on the nature of the molecule. For instance, drug 
molecules may be provided by chemical synthesis, and proteins by expression and 
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purification. The biomolecule of interest is any biomolecule, small molecule or li- 
gand, which potential interaction with another biomolecule is desired to study. Ac- 
cordingly, if its interaction with a specific target is already known, the present 
method can be used to determine whether such a target is present in a specific cell 
This can for example be used in diagnosis of disease, where the target then is a 
kaown marker of a disease condition. Alternatively, various ligands, such as a com- 
binatorial library of peptides, may be tested for their ability to bind specific target 
known to be present in a cell. This can for example be used in the screening of new 
drugs or drug candidates. In one embodiment the biomolecule of interest is selected 
from the group that consists of a polypeptide, a protein, a nucleotide, a small or- 
ganic or inorganic molecule, a fatty acid and a carbohydrate. Preferably, the bio- 
molecule of interest is a peptide, or a small molecule. In this context, the term 
"small" is used for molecules sufficiently small for the herein-described use. The 
minimum size of the biomolecule of interest is what is required for efficient bind- 
ing. For instance, this may be tried out using a series of different sizes, whereby the 
best is chosen. The affinity of the biomolecule of interest for the target biomolecule 
should be sufficiently strong for the binding to last long enough for the herein dis- 
closed purpose, i.e. to enable an identification of a target, and may for example be 
in the raM to pM range, for example 10 mM to 0. lpM. 

Thereafter, in step b, the biomolecule of interest is bound, preferably by photoacti- 
vation under conditions as close to native as possible, to a linker molecule. The 
linker molecule comprises an attachment part for binding to the biomolecule of in- 
terest. In one embodiment, the attachment part of the linker molecule is designed to 
bind a specific amino acid residue of the biomolecule of interest. In a preferred em- 
bodiment, the attachment part is a N-hydroxysuccinimide moiety or a N-maleimide. 

Furthermore, the linker molecule comprises a photoactivatable part, for subsequent 
binding to a target biomolecule, a mass-marker part, for allowing analysis of the tar- 
get, and a cleavable part, for separation of the target and the agent. 
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In the next step, step c, the biomolecule of interest is introduced in a cell either by 
active uptake, such as by pinocytosis, or by permeabilising the cells temporarily, 
e.g. by digitonin. Alternatively, a cell free system may be used, especially if one or 
more of the present biomolecules are carbohydrates. Also, the cell may be perfo- 
rated or in the form of a cell extract. The mixture obtained of biomolecule of inter- 
est and cell or cell extract is the allowed a sufficient period of time for the desired 
binding to occur. In one embodiment where the target biomolecule is a nucleotide, 
the method also either provides entrance thereof into a nucleus of a cell or alterna- 
tively the interacting biomolecule is contacted with a disrupted nucleus. 

Alternatively, the experiment may be carried out in cell extracts. 

Thereafter, the cell or the cell-free system is exposed to photolysis. The system is 
preferably kept at constant temperature, and any standard UV lamp is useable. Pref- 
erably, a tungsten carbide lamp is favoured after filtering to remove far UV, which 
is done by passing the light through a 1M copper sulphate solution (path length 1 
cm). Hereby, the photoactivatable part of the linker molecule is activated, thereby 
allowing it to bind to the target biomolecule. In one embodiment, the photoacti- 
vatable part is an azide or a benzophenone. Benzophenone may need repeated pho- 
toactivations in order to bind to the target. However, if repeated activations are per- 
formed, the probability for the benzophenone-part to bind to the target may be as 
high as 80 %. 

If the activatable part is a compound that can be activated by chemical means, the 
activation is provided by adding such a suitable chemical. Chemical activation is 
well known in the field of biochemistry, and the skilled person can easily choose a 
suitable combination of chemically activatable part/chemical degrador or cleaver. 

One advantage of activating the above discussed part for binding after it has entered 
the cell is that undesired unspecific binding with other molecules will then be 
avoided. However, the above discussed part may have been activated to be able to 
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bind a target before being contacted with the cell, as long as the binding to the target 
is sufficiently specific for the method to be functional. 

Furthermore, in order to achieve an adequate degree of binding, i.e. one that lasts 
throughout the present procedure and one that can withstand the conditions used, 
wherein normally about 10% binding degree is necessary, target linker molecules 
having varying lengths may be used in the invention in order to secure that at least 
some of the linkers bind to the target. Moreover, if the linker molecule is very long, 
it may tend to bind water, thereby limiting its activity for the target. The reason for 
the linker molecule to be used in varying lengths, is that due to the nature of the 
method, the site where the linker molecule may bind to the target as well as the 
parts of the biomolecule of interest and the target biomolecule that interact to one 
another, are unknown at forehand. Thus, it is desired to provide a linker molecule, 
which has the ability to bind to the target biomolecule, even though there might be 
some distance between the binding site to the biomolecule of interest and to the tar- 
get biomolecule. Moreover, by using linker molecules having varying lengths, in- 
formation about the naturally occurring interaction between the bait and the target 
may be provided; i.e. by studying what length of the linker molecule is optimal for 
binding to the target, information about distances between interacting parts of the 
target and the bait may be revealed. 

As mentioned above, in a specific embodiment, the present biomolecules are nu- 
cleotides, in which case the above described linker is tailored to link a nucleotide to 
another nucleotide, while keeping the feature of being cleavable as described above. 
Similarly, in alternative embodiments, the biomolecules are carbohydrates, and the 
linker can link a carbohydrate to another carbohydrate. Especially advantageous 
embodiments are when the biomolecules i.e. the target and the biomolecule of inter- 
est are of different kinds, in which cases the linker is capable of providing e.g. nu- 
cleotide-protein crosslinking, protein-carbohydrate etc. 
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Subsequently, in step (e), the linker molecule is cleaved, thereby releasing the bio- 
molecule of interest from the target biomolecule, and leaving the part of the linker 
molecule comprising the mass marker bound to the target. The cleaving of the 
cleavable part of the linker molecule may be performed by chemical means. In one 
embodiment the cleavable part is cleaved by an oxidising agent or by a base agent. 
More specifically, the cleavable part may be a geminal diol or an ester linkage, 
which can be cleaved by mild oxidation with 10 mM periodate, e.g. for 30 minutes 
at room temperature, or basic conditions, e.g. at a pH > 9 for two hours at room 
temperature, respectively. 

Optionally, the product of step (e) of the invention, i.e. the target biomolecule, may 
be cleaved by enzymatic means either separately or combined with the above- 
discussed chemical means. For instance, the digestion may be performed by using 
cyanogen bromide and/or trypsin. The cleaving can comprise an enzymatic diges- 
tion, such as with an enzyme, such as a protease (e.g. trypsin, V8 protease, such as 
Staphylococcus aureus V8 protease, LysC, AspN etc) or a glycosidase, or a chemi- 
cal digestion, such as with cyanogen bromide. However, as regards membrane 
and/or membrane associated proteins, due to their compact structure and tendency 
to aggregate when denatured, conventional enzyme digestions can be found to be 
inefficient. In one embodiment which is especially advantageous for membrane 
and/or membrane proteins, the cleaving in step (b) is an enzymatic digestion pre- 
ceded by addition of a digestive chemical, such as cyanogen bromide. More specifi- 
cally, the present inventors have used a scheme wherein the proteins are first di- 
gested with cyanogen bromide in a powerful solvent, such as 70% formic or trifluo- 
roacetic acid, with or without hexafluoropropanol. This generates medium sized 
fragments which can be readily solubilised by a conventional method, e.g. in 1% 
SDS, before dilution to about 0.01% and digestion with LysC protease. In an alter- 
native embodiment, acid-based cleavages are used, as reported by the group of Tsu- 
gita (Kamo et al. 1998 and Kawakami et al. 1997). Thus, in one embodiment, the 
cleaving in step (b) is a serine/threonine cleavage with a fluorinated acid. In a de- 
tailed embodiment, site specific cleavage at serine and threonine is carried out in 
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peptides and proteins with S-ethyltrifluorothioacetate vapour as well as at aspartic 
acid residues by exposure to 0.2% heptafluorobutyric acid vapour at 90°C. Such a 
serine/threonine cleavage method is advantageous, since Ser and especially Thr are 
found often in transmembrane segments. In summary, the skilled in this field can 
select the most appropriate method to cleave the proteins in the sample depending 
on factors such as the source of the sample, the purpose of the labelling etc. The di- 
gested proteins obtained according to the present invention are much easier to han- 
dle since physicochemically they are much simpler. Thus, an essential advantage 
with the present invention is that the separation of peptides obtained according to 
the invention can be selected to pick out virtually any one or ones of those present 
in the original sample as proteins, since the present digestion will be essentially to- 
tal. Accordingly, in the step of separation and the subsequent labelling, any one of 
all possible peptides, i.e. fragments of proteins, can be treated, even cysteine- 
containing peptides, as will be discussed in more detail below. This should be com- 
pared to the prior art methods, wherein proteins can be hidden or concealed due e.g. 
to self-aggregation. Prior methods required the separation of intact proteins and 
could not deal with peptide digests without losing the quantation aspect. The present 
method of cleavage provides homogenous peptides, which can be separated without 
the problems associated with proteins have multiple domains (hydrophobic and hy- 
drophilic) which cause them to run at multiple positions. The present digestion 
method also allows the analysis of proteins that are otherwise completely insoluble 
or are parts of large complexes, which cannot be easily separated, especially cyto- 
skeletal aggregates or proteoglycans. 

The purpose of the next step of the method, step (f), is to separate and analyse the 
target biomolecule with its bound mass marker part. Normally, this is performed by 
coupling an initial multidimensional HPLC to a mass spectrometer (MS). In one 
embodiment of the present method, the separation is by multi-dimensional chroma- 
tography. In another embodiment of the present method, standard reverse phase 
HPLC is used to separate the majority of the peptides. In a specific embodiment 
which is efficient if it is desired to get the most hydrophobic peptides, a hydrophjlic 
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interaction chromatography (HILIC) approach is used. Alternatively, a first dimen- 
sion separation can he carried out by ion exchange in the presence of a detergent 
such as octylglucoside as demonstrated previously (James P, Inui M, Tada M, Chi- 
esi M, Carafoli E. The nature and site of phospholamban regulation of the Ca2+ 
pump of sarcoplasmic reticulum. Nature. 1989 Nov 2;342 (6245):90-2; James, P., 
Quadroni, M, Carafoli, E., and Gonnet, G. (1993) Biochem.Biophys.Res.Comm. 
195, 58-64. Protein identification by mass profile fingerprinting). The detergent is 
then easily removed prior to RP-HPLC-MS analysis by using a normal phase pre- 
column. 

Moreover, in a preferred embodiment of the invention, the mass marker part has the 
ability to fragment during the analysis step. For example, the mass marker part is 
thioethyl-pyridine. Hereby, a unique mass marker is achieved, which may easily be 
detected by mass spectrometry. In a preferred embodiment, the MS is in a parent ion 
scanning mode (Anal Chem 1996 Feb l;68(3):527-33. Parent ion scans of unsepa- 
rated peptide mixtures. Wilm M, Neubauer G, Mann M. and Carr et al. 1993, Anal 
Chem 1993 Apr l;65(7):877-84 Collisional fragmentation of glycopeptides by 
electrospray ionisation LC/MS and LC/MS/MS: methods for selective detection of 
glycopeptides in protein digests. Huddleston MJ, Bean MF, Carr SA) (also de- 
scribed in the example section), wherein target biomolecules comprising the 
thioethyl-pyridine mass marker easily are detected at 106 m/z. Furthermore, in a 
preferred embodiment the MS operates in a data-dependent mode, thereby switching 
from parent ion to daughter ion scanning mode when a target biomolecule contain- 
ing the marker is detected. 

In yet another embodiment, the present method comprises the method described 
above, which further comprises the step of identifying the amino acid sequence of at 
least one of the labelled peptides. 

In one embodiment, amino acid sequence identification step is by mass spectral 
analysis using an ion trap spectrometer or a quadrupole time of flight (TOF) instru- 
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ment. However, as is realised by the skilled in this field, any MS instrument capable 
of carrying out and measuring peptide fragmentation spectra can be used to this end. 

Moreover, the amino acid identification may be followed by a data base search, in 
order to find homologues, or other relevant information, to the identified sequence. 
This may be done in order to assign a probable function for the identified sequence. 

In yet another embodiment, the linker molecule may comprise a fluorescent protein 
tag, in order to make it possible to determine its location in the cell, to which it is 
introduced. Furthermore, the linker molecule may comprise a signal tag, directing it 
to a specific location or compartment of a cell. 

Another aspect of the invention is a linker molecule, which is especially suitable for 
use in the method of the invention for cross-linking the biomolecule of interest and 
the target biomolecule. One can envision a wide range of possible crosslinkers that 
could be useful. Figure 1 shows the principal functional parts of such a molecule. 
Each position can be tailored to meet a variety of needs; the chemical attachment 
group could be an N-hydroxysuccinimide moiety for modification of lysine residues 
or an N-maleimide for cysteine labelling. The cleavable group could be a geminal 
diol for cleavage by oxidising agents or an ester linkage for base cleavage. Similarly 
the photoactivatable group could be an azide for rapid labelling or a benzophenone 
for high efficiency crosslinking. However, the mass transfer marker that will be 
used in all cases however will preferably be thioethylpyridine. The thioether bond is 
chemically extremely stable however it fragments readily under standard low energy 
MS/MS conditions in a triple quadrupole mass spectrometer. The group leaves as a 
positively charged ion with m/z of 106. This mass does not correspond to any stan- 
dard fragment found during low energy fragmentation of peptides and thus provides 
a unique marker or tag for the peptide to which it is attached. In one preferred em- 
bodiment, the linker molecule is 2-benzophenon-4-yl-carbonylamino-4, 5- 
dihydroxy-6-(N-succinimidyl)-l-(4-pyridylethyltW (figure 2). 

However, many variants of this linker are possible. It should comprise the attach- 
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ment part, the cleavable part, the label part and the photoactivatable part, but they 
should mostly be seen as functional parts, and must not necessarily be structurally 
distinguished from each other. The core part of the linker molecule is its ability to 
render a unique mass marker in the gas phase, making it suitable for parent ion 
scanning, which feature is a result of the thioether bridge. 

One essential advantage of this approach compared to standard cross-linking meth- 
ods using radioactive transfer markers such as the Denny- Jaffe reagent (Denny, J. B. 
and G. Blobel (1984). "1251-labeled crosslinking reagent that is hydrophilic, photo- 
activatable, and cleavable through an azo-linkage." Proc Natl Acad Sci U S A 
81(17): 5286-90) is that the crosslinked peptide is unequivocally identified during 
parent ion scanning. Radioactive markers have the problem that the peak of radio- 
activity occurs in an HPLC fraction together with many other peptides and so multi- 
dimensional chromatography is necessary. Also the detection of the cross-linked 
product and the identification by sequencing occur concurrently. 

Yet another aspect of the invention is a kit for use in the method of identifying an 
interacting target biomolecule to a biomolecule of interest according to the present 
invention, which comprises, in separate compartments, at least one linker molecule, 
and optionally the biomolecule of interest. Furthermore, the kit may comprise nec- 
essary reagents for the different steps of the method of the invention, as discussed 
above or in the example section. The kit comprises amounts of the reagents, which 
is sufficient for performing the method of the invention. In addition, the kit can also 
comprise written instructions for its use. 

In one embodiment, the present kit comprises a biomolecule of interest in a suitable 
buffer, a linker molecule as described above, and a cleaving agent that can cleave a 
part of the linker under appropriate conditions, each component being present in a 
separate compartment. In one embodiment, the linker comprises a part that can be 
photo-activated to enable it to bind to a suitable target, as described above. In an 
alternative embodiment, said part of the linker that can be activated by chemical 
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means, in which case the kit also comprises a suitable substance for providing such 
activation. In an advantageous embodiment, the present kit comprises an interacting 
biomolecule bound to a linker, and optionally a chemical compound capable of acti- 
vating a part of the linker to enable binding thereof to a target biomolecule. 

Still another aspect of the invention is the use of a linker molecule as described 
above for labelling the target biomolecule in the method of the invention. 

Detailed description of the drawings 

Figure 1 shows the general form of cross-linker molecule of the invention compris- 
ing chemical attachment group, cleavable linker, mass marker and photoactivatable 
group. 

Figure 2 shows the chemical structure of a specific linker of the invention, namely 
2-benzophenon-4-yl-carbonylamino-4, 5-dihydroxy-6-(N-succinimidyl)- 1 -(4- 
pyridylethylthio)-3-n-hexanone. 

Figure 3 is a schematic outline of the principle of the mass marker transfer method 
of the invention including the steps of photolysis and chemical cleavage. 
Figure 4 shows the parent ion scanning principle. Capillary HPLC, electrospray 
source, a tube with collision gas and detector are shown. 

Figure 5 shows a daughter ion spectrum and HPLC UV vs. RIC trace, with relative 
ion count / relative UV absorption on the Y-axis and time on the X-axis. 
Figure 6 is a photo of a diagonal gel electrophoresis run according to conventional 
procedure. 

The invention will now be described with the following examples, which are only 
intended to be of exemplifying character, and therefore not limiting the scope of the 
invention in any way. All references given below and elsewhere in the present ap- 
plication are hereby included herein by reference. 
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EXPERIMENTAL PART 
Example 1: The principle of the method of the invention 

To illustrate the invention, the principle of the method is shown in Figure 3. The 
linker is attached to lysine residues on the bait protein. The protein is then intro- 
duced to a perforated cell or a cell extract and allowed to equilibrate before photoly- 
sis. After crosslinking the proteins are cleaved by oxidisation and the mixture is first 
digested with cyanogen bromide and then by trypsin. 

The complex peptide mixture is then analysed by multi-dimensional HPLC inter 
faced by electrospray ionisation to a mass spectrometer operating in parent ion 
scanning mode as shown diagrammatically in Figure 4. By operating the mass 
spectrometer in parent ion scanning mode, only those peptides that give rise to an 
intense ion at 106 m/z will be detected. The mass spectrometer can be programmed 
to operate in a data-dependant mode, switching from parent ion to daughter ion 
scanning mode once a peptide containing the marker is detected. Figure 5 shows a 
preliminary experiment that has been carried out to validate the general principle of 
the approach. A synthetic calmodulin-binding peptide was modified with the 
crosslinker and photolysed in the presence of calcium and calmodulin. The mixture 
was then digested with cyanogen bromide followed by trypsin and the peptides 
separated by HPLC. The UV trace in Figure 5 shows that the expected number of 
peptides are generated whereas the parent ion total ion current trace shows that only 
one peptide has been labelled by the mass marker transfer method. 

Initially the inventors will synthesis again the molecule shown in Figure 2 and later 
they will extend this to a range of photoactivatable probes specifically tailored to 
certain types of problem (such as nucleotide-protein crosslinking or protein- 
carbohydrate for example). 

Having synthesised the crosslinker molecule of Figure 2 (2-benzophenon-4-yl- 
carbonylamino-4, 5-dihydroxy-6-(N-succinimidyl)- l-(4-pyridylethylthio)-3-n- 



WO 03/056342 PCT7EP02/14315 

16 



hexanone) it is tested using the calmodulin system mentioned briefly above. Calmo- 
dulin is an 18 kDa calcium binding protein involved in calcium signal transduction 
in many cells. Upon binding calcium it undergoes a conformational change allow- 
ing it to bind a specific domain on its target proteins. The structure of calmodulin 
alone and as a complex with several of its targets has been elucidated both by X-ray 
crystallography and NMR and hence will allow the inventors to validate the results 
they obtain from the cross-linking experiments. Calmodulin-binding domains from 
four different proteins have been synthesised. 

Example 2: Site-specific introduction of probes into proteins 

In order for the present method to be of general use, one should be able to locate the 
crosslinker on the bait protein at specific locations, especially if a protein interacts 
with more that one target. The inventor uses a cell-free translation system that is 
commercially available, namely the Roche rapid translation system (RTS). The N- 
maleimide derivative of Figure 2 is synthesised and attached to cysteine. This in 
turn can be coupled to a tRNA with a defined codon specificity as described by 
Josef Brunner's group and more recently that of Peter Schultz. Site-specific incorpo- 
ration of the crosslinker is rapidly achieved and commonly 100 \ig quantities can be 
prepared overnight. The protein carries two affinity tags and can be rapidly purified. 
The protein must then be introduced into the cells of interest by either active uptake 
or by permeabilising the cells temporarily by digitonin. Alternatively the experiment 
may be carried out in cell extracts. Once the cDNA encoding the protein is correctly 
engineered into the carrier plasmid, one can rapidly produce mutants by PCR with 
the codon for the modified tRNA. Thus, tens of proteins modified at different posi- 
tions can according to the invention be produced within a week once the system is 
set-up. 

Example 3: Methods for the analysis of large complexes 

If a complex is very large another approach must be taken for the introduction of 
crosslinkers into the bait proteins. According tot he invention, crosslinkers are in- 
troduced in a random fashion by chemical means. A very gentle method of modifi- 
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cation for crosslinking was introduced by the group of Traut to map protein-protein 
interactions in the ribosome. Iminothiolane, a reagent that reacts with lysine resi- 
dues to give a free sulphydryl group in their place, was used to modify the intact 
complex under mild conditions. Oxygen was then bubbled into the solution causing 
neighbouring sulphydryl groups to crosslink to form disulphide bridges. The com- 
plex was then separated by diagonal gel electrophoresis, in which the first dimen- 
sion is non-reducing SDS PAGE and the second is done under reducing conditions. 
Non-crosslinked proteins appear along the diagonal axis at a position proportional to 
their mass whereas crosslinked proteins appear as off diagonal vertically separated 
pairs as shown in Figure 6. This method can be modified slightly to incorporate a 
mass marker introduction step between the first and second dimension. Thus, all 
interacting pairs appearing off-diagonal can be rapidly identified by protein frag- 
ment fingerprinting (James, P., Quadroni, M, Carafoli, E., and Gonnet, G. (1993) 
Biochem.Biophys.Res.Comm. 195(1), 58-64. Protein identification by mass profile 
fingerprinting.) and their sites of interaction can be analysed simultaneously by the 
parent ion scanning method outlined in Figures 4 and 5. 

Example 4: High throughput interaction domain mapping 

An approach to mapping domain interactions between proteins can be taken which 
is similar to that used for mapping epitopes (Houghten, R. A. (1985). "General 
method for the rapid solid-phase synthesis of large numbers of peptides: specificity 
of antigen-antibody interaction at the level of individual amino acids." Proc Natl 
Acad Sci U S A 82(15): 5131-5.). The inventor synthesises a series of 20mer pep- 
tides which cover the entire sequence of a protein with 10 amino acid overlaps (i.e. 
100 peptides are needed for a 100 kDa protein). Each of the peptides will have a 
crosslinker at its N-terminal and a long cleavable spacer arm at its C-terminal sepa- 
rating it from the supporting resin. The synthesis is carried out by a standard multi- 
parallel robotic system (e.g. that of Advanced ChemTech amongst many others) in a 
96 well plate format. The known target protein, which has been labelled with a fluo- 
rescent group, is then added in the appropriate buffer to the wells and allowed to 
equilibrate. The wells are then washed under progressively more stringent condi- 
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tions and the fluorescence in each well determined after each round. Finally, the 
better binding peptides are then re-equilibrated with target protein and photolysed. 
The binding position of each peptide on the target is then determined and a domain 
interaction map is constructed. The procedure is then reversed and the bait protein is 
fluorescently labelled and used to screen the peptides from the target and the bind- 
ing results are correlated with the first map to exclude non-specific interactions. 



